Extending CLIP for Category-to-Image Retrieval in E-Commerce
نویسندگان
چکیده
E-commerce provides rich multimodal data that is barely leveraged in practice. One aspect of this a category tree being used search and recommendation. However, practice, during user’s session there often mismatch between textual visual representation given category. Motivated by the problem, we introduce task category-to-image retrieval e-commerce propose model for task, CLIP-ITA. The leverages information from multiple modalities (textual, visual, attribute modality) to create product representations. We explore how adding impacts model’s performance. In particular, observe CLIP-ITA significantly outperforms comparable only modality modality.
منابع مشابه
Deep Color Semantics for E-commerce Content-based Image Retrieval
This paper aims to develop a methodology to retrieve images based on fuzzy dominant colors expressed through linguistic descriptions. This process involves two steps: assigning fuzzy colorimetric profile to the image and processing the user query. People regard color as an aesthetic issue, especially when it comes to choosing the colors for their clothing, apartment design and other objects aro...
متن کاملCategory suggestion for e-commerce queries
This thesis proposes a category suggestion model for an e-commerce search engine to get an insight on the intent of the visitors. The dataset is provided by the Dutch e-commerce website Bol.com and consists of visitors queries and their subsequent clicks on items. The first part of the model is clustering the dataset to decrease the complexity and find previously unseen similarities between que...
متن کاملCategory-based image retrieval
This work presents a novel approach to content-based image retrieval in categorical multimedia databases. The images are indexed using a combination of text and content descriptors. The categories are viewed as semantic clusters of images and are used to confine the search space.
متن کاملExtending SAR Image Despckling methods for ViSAR Denoising
Synthetic Aperture Radar (SAR) is widely used in different weather conditions for various applications such as mapping, remote sensing, urban, civil and military monitoring. Recently, a new radar sensor called Video SAR (ViSAR) has been developed to capture sequential frames from moving objects for environmental monitoring applications. Same as SAR images, the major problem of ViSAR is the pres...
متن کاملCommerce Image Retrieval with Combination of Descriptors
It is a critical problem to find the commerce quickly and accuracy. Information from different sources can improve the retrieval performance. In this paper, we proposed a combination algorithm of color descriptor, LBP texture descriptor and HOG shape descriptor for commerce image retrieval. The commerce image retrieval experiments on commerce image dataset PI 100 indicate the combination can bo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-99736-6_20